NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

Weng, Pei-Yau; Hoang, Minh; Nguyen, Lam M; Thai, My T; Weng, Tsui-Wei; Hoang, Trong Nghia (December 2024, neurIPS)

Free, publicly-accessible full text available December 16, 2025
Offline Model-Based Optimization via Policy-Guided Gradient Search

https://doi.org/10.1609/aaai.v38i10.29001

Chemingui, Yassine; Deshwal, Aryan; Hoang, Trong Nghia; Doppa, Janardhan Rao (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Offline optimization is an emerging problem in many experimental engineering domains including protein, drug or aircraft design, where online experimentation to collect evaluation data is too expensive or dangerous. To avoid that, one has to optimize an unknown function given only its offline evaluation at a fixed set of inputs. A naive solution to this problem is to learn a surrogate model of the unknown function and optimize this surrogate instead. However, such a naive optimizer is prone to erroneous overestimation of the surrogate (possibly due to over-fitting on a biased sample of function evaluation) on inputs outside the offline dataset. Prior approaches addressing this challenge have primarily focused on learning robust surrogate models. However, their search strategies are derived from the surrogate model rather than the actual offline data. To fill this important gap, we introduce a new learning-to-search perspective for offline optimization by reformulating it as an offline reinforcement learning problem. Our proposed policy-guided gradient search approach explicitly learns the best policy for a given surrogate model created from the offline data. Our empirical results on multiple benchmarks demonstrate that the learned optimization policy can be combined with existing offline surrogates to significantly improve the optimization performance.
more » « less
Full Text Available
AID: Active Distillation Machine to Leverage Pre-Trained Black-Box Models in Private Data Settings

https://doi.org/10.1145/3442381.3449944

Hoang, Trong Nghia; Hong, Shenda; Xiao, Cao; Low, Bryan; Sun, Jimeng (April 2021, The Web conference)
null (Ed.)
This paper presents an active distillation method for a local institution (e.g., hospital) to find the best queries within its given budget to distill an on-server black-box model’s predictive knowledge into a local surrogate with transparent parameterization. This allows local institutions to understand better the predictive reasoning of the black-box model in its own local context or to further customize the distilled knowledge with its private dataset that cannot be centralized and fed into the server model. The proposed method thus addresses several challenges of deploying machine learning (ML) in many industrial settings (e.g., healthcare analytics) with strong proprietary constraints. These include: (1) the opaqueness of the server model’s architecture which prevents local users from understanding its predictive reasoning in their local data contexts; (2) the increasing cost and risk of uploading local data on the cloud for analysis; and (3) the need to customize the server model with private onsite data. We evaluated the proposed method on both benchmark and real-world healthcare data where significant improvements over existing local distillation methods were observed. A theoretical analysis of the proposed method is also presented.
more » « less
Full Text Available
Literature mining for alternative cementitious precursors and dissolution rate modeling of glassy phases

https://doi.org/10.1111/jace.17631

Uvegi, Hugo; Jensen, Zach; Hoang, Trong Nghia; Traynor, Brian; Aytaş, Tunahan; Goodwin, Richard T.; Olivetti, Elsa A. (February 2021, Journal of the American Ceramic Society)

Abstract Efforts to reduce the carbon footprint associated with cement and concrete production have resulted in a number of promising lower‐emission alternatives. Still, research has emphasized a small subset of potentially useful precursor materials. With the goal of expanding the precursor pool, this work presents results of parallel literature mining and rate modeling activities. As a result of literature mining, materials with appropriate SiO₂, Al₂O₃, and CaO concentrations were assembled into a comprehensive, representative ternary diagram. 23 000+ materials were extracted from 7000 journal articles, and 7500 materials from 6000 articles with 80 ≤ SiO₂ + Al₂O₃ + CaO ≤105 wt% automatically classified. Both supervised and semi‐supervised models were used for dissolution rate prediction of glassy materials with all models pulling from a single data set (n = 802 reported dissolution rates from 105 different glasses). Supervised modeling utilized linear and decision tree regressions to determine features most predictive of dissolution rate, resulting in log‐linear relationships between rate and pH, inverse temperature (1/K), and non‐bridging oxygen per tetrahedron (NBO/T). Semi‐supervised modeling was observed to be more robust to broader feature inclusion, providing similar predictive ability with a relatively larger set of descriptive features. Most importantly, results indicated that models trained on data from disparate scientific communities were adequately predictive (RMSE ≈ 1), particularly under pH ≥7 conditions relevant to the cement and alkali activation communities.
more » « less

Search for: All records